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Microbial nonribosomally processed peptides represent a large 
class of natural products including numerous important phar- 
maceutical agents, as well as other representatives that play a 
prevalent role In pathogenicity of certain microorganisms [M. A* 
Marahiel, T, Stachelhaus, and H. D. Mootz (1997). Chenu Rev. 97, 
2651-2673]. Although diverse in structure, nonribosomally syn- 
thesized peptides have a common mode of biosynthesis. They are 
assembled on very large protein templates called peptide synthetases 
that exhibit a modular organization, allowing polymerization of 
monomers in an assembly-line-like mechanism. © 2ooi AcademkPreu 



INTRODUCTION 

Nonribosomal peptide synthetases (NRPSs) are organized 
in iterative functional units called modules that catalyze 
a handful of catalytic reactions leading to peptide forma- 
tion. A minimal elongation module that carries out one 
elongation step comprises at least three catalytic domains: 
one for substrate recognition and activation by adenyla- 
tion [adenylation domain (Conti et ciL, 1997)], one for 
covalent incorporation as thioesters on an enzyme-bound 
4'-phosphopantheinyl (4'PP) moiety [thiolation domain 
(Stachelhaus era/., 1996)], and one for condensation of the 
precursor (condensation domain) leading to formation of a 
peptide concatenated to the 4TP moiety of the next module 
(Stachelhaus et ai, 1998). Insertions of optional catalytic 
domains into a minimal module may result in further sub- 
strate modification including epimerization of a-carbon 
atoms [epimerization domain (Stindl and Keller, 1994)] or 
N-methylation of a-amino groups [N-methylation domain 
(de Crecy-Lagard et cil, 1997)]. Condensation of two 
amino acid precursors can also lead to heterocyclic ring for- 
mation of oxazoline or thiazoline types (Gehring et al„ 
1998b). Such reactions involving side chains of substrate 
amino acids like threonine, serine, or cysteine are accom- 
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plished by inherent peptide synthetase domains called 
cyclization domains (Konz et al., 1997). Release of the pro- 
cessed peptide from the last module*s 4TP moiety is 
implemented by thioesterase-like domains resembling 
analogous domains from fatty acid synthases (Shaw-Reid et 
ai, 1999). Nonribosomal synthesized peptides are released 
from the template as linear, cyclic, or branched cyclic 
molecules. Release of the thioester-bound peptides can also 
be catalyzed by NADPH-dependent reduction to peptidyl 
amino aldehydes (Ehmann et aL, 1999) or peptidyl amino 
alcohols (Billman-Jacobe et al., 1999). Taken into account 
that nonribosomal peptide synthesis is not restricted to 
proteinogenic amino acids — a few hundred carboxy acids 
are known to be incorporated — together with the set of 
possible modifications during assembly an immense number 
of peptides and peptide analogous can be designed by 
nature employing a single group of proteins (Konz and 
Marahiel, 1999). 

Due to the modular arrangement of peptide synthetase 
these biofactories have an enormous size (Weber et al., 
1994)_the largest known peptide synthetase directing the 
synthesis of the immunosuppressive agent cyclosporin 
contains 1 1 modules on a single polypeptide chain and has 
a mass of 1600 kDa. An illustration of the stepwise N- to 
C-ierminal assembly of nonribosomal peptides is given in 
Fig. I, depicting the biosynthesis of bacitracin in Bacillus 
licheniformh (Konz et ai, 1997). In most cases, primary 
sequence of the processed peptide and arrangement of the 
modules display a colinearity underlining the assembly-line 
mechanism (see an exception from this rule in biosynthesis 
of syringomycin below). 

Diversity of nonribosomally processed peptides is not 
restricted to their structure; their biological impact is also 
impressive. Important pharmacological agents like the 
immunosuppressive cyclosporin (Weber et aL^ 1994) and 
the peptide backbones of antibiotics of the vancomycin 
(van Wageningen et ai, 1998), penicillin, and cephalosporin 
(Gutierrez et ai, 1991: Smith et al., 1990) group are 
produced by peptide synthetases (see Table 1). Some 
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peptides play a role in pathogenicity of certain microorga- 
nisms: the hepatotoxins of the cyanobacterium Microcystis 
aeniginosa (Dittmann et aL, 1997) as well as several fungal 
toxins are produced by peptide synthetases (see Table 1) 
(Scott-Craig et al, 1992; Tudzynski et aly 1999). Also, the 
production of ferric-chelating siderophores like vibriobactin 
(Wyckoflf et al, 1997) by enteric microorganisms depends 
on peptide synthetases. Their products therefore play a 
prevalent role in pathogenicity of the organisms causing 
wound infections, plague, cholerae, or tuberculosis. Lipo- 
peptides like surfactin of Bacillus subtilis have also awoken 
interest because of their biotechnological relevance in 
oil-recovery processes (Fiechter, 1992). 

Recently a large number of genes encoding peptide 
synthetases have been identified. An approach taking 
advantage from the ubiquitous presence of highly conserved 
core sequences has been devised to identify unknown 
peptide synthetase genes in several microorganisms (Turgay 
and Marahiel, 1994). Also, genome-sequencing projects of 
model organisms provide us with an increasing number of 
peptide synthetase genes. 

PEPTIDE SYNTHETASES ARE COMPOSED OF 
CATALYTIC DOMAINS 

Sequence comparisons within NRPSs helped to define 
catalytic domains within each module. It could be shown 
that these domains can be considered as semiautonomous 
units that retain their potential of catalyzing part reactions 
of nonribosomal peptide synthesis when overproduced as 
distinct proteins (Mootz and Marahiel, 1997; Stachelhaus 
etaU 1996). 

Compared to the ribosomal protein machinery, NRPSs 
are less specific. Probably the evolutionary force to main- 
tain a given substrate selection is not very strict. The main 
gatekeeper for specificity of NRPSs is the adenylation 
domain that catalyzes the activation of cognate carboxy 
acids as adenylates by hydrolysis of ATP. Adenylation 
domains that are ca. 550 amino acids in size represent there- 
fore a member of the superfamily of adenylate-forming 
enzymes like firefly luciferases and acyl-CoA-Iigases. 
Usually adenylation domains in NRPSs are found as part of 
a larger protein frame but it could be demonstrated that 
they exhibit a comparable catalytic activity also when 
expressed heterologously as separated units (Mootz and 
Marahiel, 1997). The importance of highly conserved 
residues within the A-domains for catalytic efficiency was 
demonstrated by site-directed mutagenesis (Gocht and 
Marahiel, 1994) and photoaffinity labeling experiments 
(Pavela-Vrancic et al, 1994). The most impressive insight 
into the architecture of adenylation domains can be drawn 



from the crystal structure of the first adenylation domain of 
the gramicidin S synthetase GrsA (Conti et al, 1997). 
Although a low homology exists in primary structure of 
NRPS adenylation domains and firefly luciferases. their 
overall three-dimensional structure is almost identical 
(Denessiouk and Johnson, 2000). Because adenylation 
domains of NRPSs share a sequence identity of about 
30-60%, the stmcture of GrsA can be anticipated as 
prototype for all adenylation domains. The crystal structure 
underlines the importance of nine residues in substrate 
recognition. A detailed alignment of these residues in other 
adenylation domains with diverse specificity allowed estab- 
lishment of a nonribosomal code (Stachelhaus et al, 1999). 
Altering the specificity of adenylation domains in a predict- 
able way opens new directions to change the structure of 
natural compounds. 

A small protein domain of ca. 80 residues that follows 
the adenylation domain is called the thiolation domain 
(Stachelhaus et al, 1996). This structural-independent unit 
displays a highly conserved serine residue that serves as the 
attachment site for the x^ofactor 4'-phosphopantetheine. The 
thiol moiety of the cofactor is the carrier for the previously 
activated carboxy acid. In analogy to fatty acid and 
polyketide biosynthesis, this domain is also designated as 
peptidyl carrier protein (PCP). Since it has been elucidated 
that each module contains a PCP domain and therefore a 
4'-phosphopantetheine arm, the initial model of NRPS 
(which suggested only a single, central cofactor) has been 
rendered to the now accepted multiple carrier model (Stein 
et al^ 1996), The recently determined solution structure of a 
PCP domain from the tyrocidine synthetase module TycC3 
revealed a common fold for ACP and PCP domains ( Weber 
et al, 2000). 

The presence of the cofactor 4'-phosphopantetheine— the 
state of a holo-enzyme^is crucial to nonribosomal peptide 
synthesis (Nakano et aL 1992). This specific phosphopan- 
tetheinylation is implemented by CoA-binding enzymes of 
the superfamily of4TP transferases (Lambalot etal, 1996). 
The enzymatic reaction results in phosphate ester formation 
of the hydroxyl group of the invariant serine residue of ACP 
and PCP domains with the 4'-phosphopantetheinyl moiety. 
Genes encoding for 4TP transferases in prokaryotes are 
often found adjacent to peptide synthetase operons: 
nevertheless, when an organism contains two or more 
NRPS systems it seems likely that one 4'PP transferase is 
sufTiciem to modify both of them. 4'PP transferases have 
been the subject of extended biochemical and structural 
investigations: PCP and ACP domains can be transferred to 
holo-form by 4'PP transferases in the presence of CoA and 
magnesium in vitro { Wcinreb et oL, 1 998 1 Also, posttransla- 
tional modification of NRPS to holo-enzyme in vivo was 
showm when coexpression with genes encoding 4'PP 
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Fungal peptide synthetase clusters: 
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Legend: 

t antibiotic; § siderophore; A immunosuppressant; * phytotoxin; # cyctostatic agent. Domains of peptide synthetases in bold: A, adenylation domain; 
T, thiolation don>ain; C, condensation domain; epimerizaiion domain; 2, heterocyclization domain; M, N-methyttransferase domain; Te, 
thioesterase domain; R, reductase domain; F, N-formylase domain. Domains of polyketide synthases: KS, ketosynthase; AT, acyttransferase; MT, 
methyltransferase; KR, ketoreductase; DH. dehydrogenase; ACP, acylcarrierprotein; AaT, amino add transferase; braci^ets indicate truncated 
domains. Substrate amino acids are abbreviated with three-letter-code, modified amino adds are indicated by the use of prefixes; D, D- 
configuration; alto, alto-conflguration: Me, N-methylation; xP, O-phosporylation at position x related to the a carbon atom; h. dehydro; xh, 
hydroxylation at position x related to the ot carbon atom; xf, formylation at position x related to the a carbon atom. Greek letters indicate position of 
peptide bond related to the a carbon atom. Abbreviations of non-proteinogenic amino acids and hydroxy adds: Aad, amtno-adipic acid; Aeo, 2- 
amino-8-oxo-9,l0-epoxy decanoic acid; Bmt, 4-{2-butenyl)-4-methyl-threonine, Dab. 2.4-dlamino-butyric-acid; DHB, dihydroxy benzoate; HIV, 2- 
hydroxy-isovalerat; HPG, 4-hydroxy-phenylglycrne; Mdha, N-methyldehydro-alanine; Orn, ornithine; Pip, pipecolic acid: Sal, salicylate. FA, fatty 
acid. Anabaenopeptolide. B structure from Alternaria maii according to [Higashljima, 1979 #1104]. 
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transferases was performed (Kealey et aU 1998). Certain 
4TP transferases do not exhibit a restrictive substrate speci- 
ficity toward the PCP or ACP domain so that hetero- 
logous thiolation domains can be modified to holo-forms 
(Lanibalot ei al., 1996). The recently determined crystal 
structure of the 4'PP transferases Sfp (28 kDa) of surfactin 
synthetases in B. suhiilis revealed an unexpected intra- 
molecular twofold pseudosymmetry of the protein fReuter 
et ai, 1999). This structure may provide an explanation for 
the observation that the homologous 14-kDa ACPS (ACP- 
synthase of Escherichia coll) may function as homodimer. 

The stepwise N- to C-terminal condensation of the 
thioesterified carboxy acids is accomplished by a domain 
located N-terminal of adenylation domains. The role of this 
condensation domain in the formation of a peptide bond 
has been clearly assigned in an in vitro study using a trun- 
cated dimodular model system (Stachelhaus et ciL, 1998). 
This domain type is ca. 450 amino acids in length. In 
general, the number of condensation domains within a 
peptide synthetase is consistent with the number of peptide 
bonds that are formed (see below). In contrast to adenyla- 
tion and thiolation domains, condensation domains display 
a protein class that has no extended sequence homologies to 
proteins of known mechanisms. The presence of a highly 
conserved motif HHxxDGxS (His motif) that is also found 
in acyl transferases like chloramphenicol acetyltransferases 
and lipoyi transferases suggests a mechanism in which one 
of the histidines may enhance the nucleophilic character of 
the amine nitrogen to facilitate formation of the peptide 
bond (Dc Crecy-Lagard et ai, 1995), The importance of the 
second histidine in catalysis of peptide bond formation 
could be shown by site-directed mutagenesis (Stachelhaus 
et aL 1998). 

Because condensation domains play the predominant 
role in catalysis of peptide bond formation, the knowledge 
about its substrate tolerance is crucial for any attempt to 
manipulate NRPSs. Condensation domains moreover 
guard initiation of peptide bond formation, substrate mis- 
loading, and timing of downstream processes (Linne and 
Marahiel, 2000). Assays to elucidate condensation domain 
flexibihty using an intentionally misloaded NRPS system 
suggested a rigid substrate recognition on acceptor site, 
whereas the donor amino acid is tolerated to a larger degree 
(Bclshaw et aL, 1999). This observation has already been 
applied to construct functional hybrid NRPSs obtained by 
domain fusions (Dockel and Marahiel, 2000; Mootz et al., 
2000). Such designed NRPS systems can be used as biofac- 
tories for a defined enzymatic peptide synthesis. 

Condensation domain sequences adapt to NRPS's 
varying domain arrangements: those that are located in the 
C-terniinus of cpimerization domains are clearly to be 
distinguished from those located in the C-terminus of 
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thiolation domains. Domains involved in initiation of acyl- 
transfer during assembly of iipopeptides represent a third 
group of condensation domains (Doekel and Marahiel, 
unpublished). 

Alternatively, condensation of two intermediate carboxy 
acids can lead to heterocycles of the thiazoline or oxazoline 
type. This type of specialized condensation domain has 
therefore been designated as a cyclization domain (Konz et 
aL, 1997). Cyclization domains in general are found in the 
same domain arrangement as condensation domains. Their 
sufTicient role in formation of heterocycles has been 
demonstrated in vitro in a yersiniabactin biosynthesis 
system (Gehring et aL, 1998b), Cyclization domains can be 
clearly distinguished from homologous condensation 
domains; interestingly, they lack their typical His motif. 
There are only poor data dealing with the mechanism 
of the cyclization reaction (Gehring et (7/., 1998). Thiazoline 
structures as a product of cyclization domains can be 
further oxidized by the action of an optional domain that 
shows homologies to flavin-dependent oxy reductases. This 
domain type that has been identified only recently 
represents a 400-amino-acid-residue protein found as an 
insertion in adenylation domains (Molnar et aL, 2000) (see 
N-methylation domains) or in the C-terminus of thiolation 
domains, respectively (Silakowski et al„ 1999). Whereas the 
formation of thiazol structures therefore is also an intrinsic 
property of NRPSs, reduction to thiazolidine structures (see 
yersiniabactin) is accomplished by an external NADPH- 
dcpendent oxyreductase (Bearden et aL, 1997; Butterton et 
aL, 2000). 

A type of domain with an epimerization activity is found 
as an optional insertion into minimal modules. Epimerization 
of carboxy acids to their D-configuration is accomplished by 
such epimerization domains located between a thiolation 
and a condensation domain (Stachelhaus and Walsh, 2000). 

Another optional insertion in the domain arrangement of 
a minimal module was also found between adenylation and 
thiolation domains conferring N-methylation of substrate 
amino acids (Zocher and Kleinkauf, 1978). Sequence 
analysis of this domain type (ca. 420 amino acids) reveals 
similarities to other 5-adenosylmethionine (SAM)-depen- 
dent methyltransferases (Hacker et aL, 2000). Obviously 
N-modification occurs prior to peptide bond formation 
(HaeseeM/., 1993). 

Only recently a domain type obviously involved in 
N-formylation of starter amino acids has been identi- 
fied (Rouhiainen et aL, 2000). Therefore, this domain 
organization represents a new alternative in initiation of 
nonribosomal peptide synthesis. 

Release of peptide products from modular peptide 
synthetases is usually accomplished by a domain found at 
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the C-terminus of peptide synthetases- This domain type 
that is ca. 250 amino acids in length shows homology to 
thioesterases and was also found in fatty acid and 
polyketide synthases. They contain a GxSxG motif com- 
mon for thioesterases and acyltransferases. The importance 
of this domain in cleavage of a peptide product could be 
shown by in vivo studies (Guenzi et aL 1998b; Schneider 
and Marahiel, 1998). The peptide is transferred to the 
hydroxyl side chain of a serine in the motif GxSxG. This 
ester bond is subsequently cleaved by a nucleophilic attack 
of water (releasing a linear product) or an internal side 
chain of the peptide (releasing a cyclic product) (Shaw-Reid 
et al., 1999), Recent in vitro studies with the recombinant 
isolated thioesterase domain from TycC revealed that 
hydrolysis via cyclization is an intrinsic factor of the 
thioesterase domain itself. This enzymatic domain exhibits a 
broad flexibility toward nonnative substrates that seems 
only to be restricted to the N-terminus of the peptide 
(Trauger et ai, 2000). Obviously, the selective recognition 
of the nucleophilic group prevents the NRPS system from 
the formation of mixed cyclic peptides. In addition, a 
proposed stereoselectivity of the ACV's thioesterase domain 
times and regulates the product outcome (Kallow et qL, 
2000). 

In some biosynthetic gene clusters, external thioesterase- 
like encoding genes were found. Gene disruption studies for 
external thioesterases indicated an important but not essen- 
tial role in product formation (Schneider and Marahiel, 

1998) . 

Alternatively, release of the product from the enzyme can 
be implemented by a reductive step yielding aldehyde or 
alcohol functions at the C-terminus of the peptides 
(Ehmann et al., 1999). These reductase domains employ an 
NAD(P)H-dependent mechanism (Konz and Marahiel, 

1999) . 

In summary, we have described the domain organization 
of peptide synthetases with a minimal module containing 
adenylation, thiolation, and condensation domains that 
bear the potential to form a single peptide bond. Minimal 
modules can be extended by epimerization or N-methyla- 
tion domains. Alternatively, condensation domains can be 
replaced by cyclization domains. For release of final 
products, either thioesterase-like or NA(D)PH-dependent 
reductive domains are utilized. 

DISTINCTIONS BETWEEN BACTERIAL AND 
FUNGAL SYSTEMS— NOVEL CHARACTERISTICS 
OF PEPTIDE SYNTHETASES 

Among the first peptide synthetase genes that have been 
discovered in fungi were the ACV synthetases of Penicilliwn 
chrysogenum (Smith et al., 1990) and the huge cyclosporin 
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synthetase of Tolypodadium inflaiitm (Weber et al, 1994). 
Due to the lack of introns within these large genes, it has 
been suggested that fungi acquired peptide synthetase genes 
by horizontal gene transfer from bacteria. This proposal 
must be revised because recently two fungal peptide 
synthetase genes have been discovered that bear introns: 
the ergotamin synthetase LPSl of Clmiceps purpurea 
(Tudzynski et al., 1999) and a peptide synthetase gene of 
unknown function in Metarhizium anisopliae (Bailey et a/., 
1996), so that peptide synthetase genes may have been 
present in fungi for an evolutionarily longer period than 
expected. In parallel, some characteristics of fungal peptide 
synthetases become obvious that differ from bacterial 
systems: most fungal peptide synthetases consist of one 
protein, whereas bacterial enzymes in most cases are dis- 
tributed on more than one polypeptide chain (Weber et al., 
1994). Fungal peptide synthetases seem to share a distinct 
mechanism of chain termination: only in ACV synthetases is 
the described thioesterase domain present, whereas all other 
yet known fungal systems instead employ a C-terminal con- 
densation domain (so that the nuniber of condensation 
domains is N-\- 1 when is the number of peptide bonds to 
be formed) (Haese et aL 1993). The additional C-domain is 
discussed to accomplish a condensation step that goes along 
with release of the processed peptide. Also, fungal non- 
ribosomal synthesized peptides often contain D-amino 
acids, but the corresponding peptide synthetases contain no 
epimerization domain. In such cases, an external epimerase 
seems to provide a D-amino acid that is incorporated by an 
adcnylationdomainhighlyspecificforD-configured substrates. 

Some peptide synthetases like those of syringomycin 
(Guenzi et al., 1998a) and ferrichrome comprise a domain 
organization that omits adenylation domains (see Table 1 ). 
Module organizations like C-T instead of C-A-T are found. 
Tt has been discussed that such additional truncated 
modules may be served as sites for trans-acylation reactions. 
In syringomycin biosynthesis, the C-terminal module is 
believed to be loaded with threonine that is activated 
on an independent one-module enzyme called SyrB. SyrB 
however is encoded by a gene located upstream of the 
3-gene operon. If this is true, it would be the first example 
that breaks with the so far ubiquitar rule of a colinear 
assembly line of NRPS. The idea of an in trans acylation — a 
sort of cross'cntry in nonribosomal peptide synthetase 
— could also explain the unusual domain organization 
found in yersiniabactin synthetase of Yersinia pestis (see 
Table 1 ). Here, the processed precursor peptide should con- 
tain three cysteine residues but only one adenylation 
domain-activating cysteine is present. Biochemical data 
underline that this single adenylation domain can acylate 
three downstream-located thiolation domains in trans. 
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Domains of C-T organizations are also found in the 
recently characterized biosynthetic cluster for the acylhep- 
tapeptide mycosubtilin in B. subtilis ATCC 6633 (Duitman 
et al., 1999) (see Table I). In addition, the first protein of 
this peptide synthetase system, MycA. is a functional hybrid 
protein that contains parts of a fatty acid synthase and an 
amino acid acyltransferase. Here, MycA is believed to 
catalyze the synthesis of the fatty acid tail that is transferred 
to the first activated amino acid by the action of the amino 
acid transferase. It is not yet clear which role the inter- 
mediate CT domain organization plays: Nevertheless, the 
identification of a transferase engaged in initiating peptide 
synthesis of acylated peptides is new and has not been 
reported before. An analogous amino acid transferase 
coupled with a peptide synthetase has also been identified 
recently as part of the biosynthetic cluster of microcystin in 
M. aeruginosa (Nishizawa et al., 1999). 

The mycosubtilin synthetases MycB and MycC also dis- 
play an unusual domain junction: whereas in intermolecular 
domain arrangements the condensation domain is usually 
found as part of the acceptor protein (an intermolecular 
T-C junction), in MycA and MycB the condensation 
domain is found as part of the donor protein providing an 
intermolecular C-A Junction. Analogous domain arrange- 
ments have so far only been reported in the exochelin 
biosynthesis of Mycobacterium smegmatis (Fiss et aL^ 1994). 

MIXED PEPTIDE SYNTHETASES AND 
POLYKETIDE SYNTHASES 

The striking analogy of the architecture of modular 
polyketide synthases and peptide synthetases has been the 
topic of a recently published review (Cane and Walsh, 
1999). In fact, both systems represent biofactories that are 
combined by nature to yield mixed NRPS-PKS natural 
products which greatly extends the structural diversity of 
products. 

The first identified mixed NRPS-PKS biosynthetic gene 
cluster was that of rapamycin of Streptomyces hydroscopius 
(Schwecke et aL, 1995). It contains an NRPS module for 
pipecolic acid incoiporation into the polyketide. An 
analogous organization has been identified in the FK506 
biosynthetic gene cluster in Streptomyces sp. A//I6548 
(Motamedi and Shafiee, 1998). Interestingly, in both 
systems two condensation domains flanking the adenylation/ 
thiolation domain are found. The crossing of NRPS 
modules with up- and downstream polyketide synthase 
modules is guaranteed apparently by employing this 
domain organization. 

NRPS biosynthetic gene clusters that contain polyketide 
synthase portions have also been identified in Yersinia 
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pestis, responsible for yersiniabactin biosynthesis (Bearden 
et ciL, 1997; Gehring et al„ 1998a). In this system, the mixed 
NRPS/PKS protein Irpl contains an integral PKS module 
probably responsible for the incorporation of a malonyl- or 
acetyl-CoA extender unit linked between two thiazoline/ 
thiazolidine moieties. It would be interesting to evaluate 
questions concerning how the cross-talk of these two 
biosystems is accomplished. 

With the increasing number of identified natural product 
biosynthetic gene clusters, several mixed NRPS/PKS 
systems have recently been detected. The mycobactin NRPS 
of Mycobacterium tiiberculo^iis (Quadri et ai, 1998) 
contains a PKS module and the microcystin biosynthesis of 
M. aeruginosa (Neilan et al., 1999) utilizes a mixed 
NRPS/PKS machinery. The latter cluster contains a three- 
modular polyketide synthase that is probably involved in 
the biosynthesis of an uncommon amino acid based on 
decadienoic acid. The recently characterized biosynthetic 
gene clusters for epothilone and myxothiazol also employ 
mixed NRPS/PKS machinery (Molnar et aU 2000; 
Silakowski c//.. 1999). 



IMITATING NATURES POTENTIAL 



As sequences of genes coding for peptide synthetases 
become readily available, detailed comparative sequence 
alignments were used to find an answer to questions like 
"what defines a module or a domain" and "what are the 
borders of domains?" However, a clear answer to assess 
domain sizes cannot be given in advance. Simple model 
systems to elucidate domain borders must be devised 
(Mootz and Marahiel, 1997; Stachelhaus a/., 1996). Does 
a fusion of an epimerization domain to a recombinant AT- 
domain lead to substrate epimerization regardless of the 
amino acid's nature (Linne and Marahiel, unpublished)? Is 
any processed peptide readily cleaved by fused C-terminal 
thioesterase domains and, if so, how (Gokhale et oL, 1999a; 
Mootz et ai, 2000; Trauger et ai, 2000)? Can recombinant 
model di- or trimodular hybrid systems employing non- 
native condensation domains be used to catalyze peptide 
bond formation (Doekel and Marahiel, 2000; Mootz et ai,, 
2000)? 

To foresee the potential of redesigning (un)natural 
products by engineered peptide synthetases, we will outline 
a strategy of constructing a hybrid peptide synthetase on 
scheme with the potential to produce a small peptide 
derivative. Production of a functional recombinant peptide 
synthetase in an approved surrogate host could overcome 
problems like cultivation or high-level fermentation ( Kealey 
et aLy 1998). This strategy should also facilitate further 
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FIG. Z Knowledge-based strategy for a targeted construction of a hybrid peptide synthetase directing the synthesis of the peptide derivative antipain. 
(A) The toolbox < ] ). Paris of the syringomycin. glycoUpopeptide, and anabaenopeptolide biosynthelic gene clusters from Fsttuhmonas syringae, 
Mycohactei'ium smegwatis, and Anabaemi strain 90, respeciively, are illustrated. Brackets indicate regions for the virtual experiment used below. (B) 
Engineering steps. Site-directed mutagenesis ( III J is used to alter the specificity of the D-Dabadenylation domain to Va! and to make the second condensa- 
tion don'iain condensate the carbomyl-function ( II, indicated by asterisks), (Cj Compiling. Gene segments encoding the domains are fused together, (D) 
A new template. The designed tetramodular template contains an N-termina! formylation domain implementing the N-capping of the peptide (IV). The 
four modules take account for the ordered incorporation and condensation of the peptide portion Phe-Arg-Vat-Arg. The C-termintil reductase (indicated 
by a R) domain cleaves the processed peptide derivative from the enzyme leaving a reduced aldehyde function ( VI). 



genetic manipulations with convenient techniques inherent 
to a surrogate host. 

The strategy employs operations that have already been 
discussed elsewhere (Mootz and Marahiel, 1999) like 
module and domain swapping, change-of-substrate speci 
ficity by mutagenesis, and an induced termination to 
achieve release of a defined shortened product. This strategy 
also involves use of a recently discovered formyl transferase 
(Rouhiainen a ai, 2000). 

Antipain is a potent cathepsin inhibitor produced by 
cultures of Sireptomyces rosevs (Umezawa et aL 1972) (see 
Fig. 2) whose biosynthetic machine is unknown. Its peptide 
portion consists of three L-amino acids with the primary 
sequence Arg-Val-Arg, The N-tenninus is linked to the car- 
boxy group of a N-formylated L-phenylalanine (carbamoyl 
function). The C-terniinus is reduced to an aldehyde. It is 
likely that antipain is produced by a yet unknown peptide 
synthetase. The proposed scenario is fictive but (based on 
present knowledge) should generate the same compound. 

The antipain peptide synthetase backbone should be 
tetramodular (see Fig. 2). The specificity of modules 1 to 4 



should be phenylalanine, arginine, valine, and arginine. To 
construct such a tetramodular template, the following 
hybrid fusions can be employed; The modules requested can 
be ^'extracted" from the syringomycin biosynthesis operon 
from Pseitdomonas syringae (I, see Fig. 2) (Guenzi et ciL, 
1998a) and subsequently fused together in a defined order to 
generate a hybrid template for the production of the 
tetrapepttde Phe-Arg-d-Dab-Arg. Functional hybrid NRPS 
systems have been generated recently (Doekel and 
MarahieL 2000; Stachelhaus et aL, 1995). These artificial 
templates obtained by different types of domain and module 
fusions were shown to retain their potential to form peptides 
with defined structures. A rigid substrate specificity of the 
condensation domain toward the acceptor amino acid 
(Belshaw et aL, 1999) requires the use of the acceptor 
module's condensation domain [here C-A-T fusions, il, see 
Fig. 2 (Mootz et aL, 2000); T-C-A have also been shown to 
yield functional hybrids (Doekel and Marahiel, 2000)]. A 
role of interdomain linkers in maintaining communications 
between hybrid domains has been discussed (Gokhale et aL. 
i999b). 
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Several modifications must be employed whose feasibility 
has been confirmed generally using model NRPS systems: 

(1) The specificity of the third module must be ren- 
dered from o-Dab to Val using site-directed mutagenesis, 
giving a template for thetetrapeptide Phe-Arg-Val-Arg (III, 
see Fig. 2). The elucidation of the nonribosomal code— the 
identification of the active-site residues involved in substrate 
recognition — provides a versatile tool for a predictable 
alteration of substrate specificity (Stachelhaus et al, 1999). 

(2) N-formylation in nonribosomal peptides is 
accomplished by a domain type that has been identified 
only recently (Rouhiainen et al.y 2000). To provide an 
N-formylated starter amino acid phenylalanine, the first 
module should be N-terminal fused to a formylation 
domain (IV, see Fig. 2). On the other hand, the synthesis of 
some N-capped nonribosomal peptides like that of the acyl 
peptides syringomycin (Guenzi et aL, 1998a) and surfactin 
(Cosmina et aL, 1993) is initiated by an N-terminal starter 
condensation domain. 

(3) Some Odomains — for example, those found in 
exochelin (Fiss et aL, 1994; Yu ei aL, 1998) and ACV 
biosynthesis (Smith et al., 1990)— generate d-, or 
^-peptide bonds. To aim at an antipain template, the first 
condensation domain used should preferably involve the 
carbamoyl-funciion for the formation of a peptide bond 
(V, see Fig. 2). The mechanism that governs this sort of 
selectivity has not yet been elucidated (Keating and Walsh, 
1999). 

(4) For release of the processed N-formylated 
tetrapeptide, a domain encoding for a NADPH-dependent 
reductase such as Lys2 involved in the lysine biosynthesis in 
yeasts (Ehmann et aL, 1999) and certain other peptide syn- 
thetase systems (BillmanJacobe et aL, 1999) (see Table 1) 
should be used (VI, see Fig. 2). An induced release of such 
peptide products has been reported in vivo (de Ferra et aL, 
1997; Guenzi et aL, 1998b) and— recently — in vitro upon 
fusion of thioesterase-like domains to upstream modules 
(Mootz etaL, 2000). 

The proposed strategy can analogously be extended to a 
vast set of small peptides also to generate templates for the 
synthesis of vertebrate or industrial important peptides. 

CONCLUSIONS 

An increasing number of NRPS sequences become 
available. Based on these data and recent biochemical 
studies, a solid interpretation of the principles of NRPS's 
architecture and organization can now be stated. The 



potential of this biosynthetic machinery increases by the 
recent identification of integrated enzymatic domains. 
Studying the mechanistic principle of multidomain NRPS 
will consolidate attempts for their manipulation. To govern 
the rules of NRPS opens the way to redesign natural 
products and to assist nature^s complexity. Based on the 
current knowledge, design of defined small peptides on 
request by combinatorially puzzled peptide synthetases 
becomes realistic. 
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